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New  Haven,  Connecticut  0651.0 , USA 


Abstract 

This  paper  considers  the  possibilities  for 
knowledge-based  automatic  text  translation  in  light  of 
recent  advances  in  Artificial  Intelligence.  It  is  argued 
that  competent  translation  requires  some  reasonable  depth 
of  understanding  of  the  source  text,  and  in  particular, 
access  to  detailed  contextual  information.  The  following 
Machine  Translation  paradigm  is  proposed:  First,  the 
source  text  is  analyzed  and  mapped  into  a language-free 
conceptual  representation.  Then,  inference  mechanisms 
apply  contextual  world  knowledge  to  augment  the 
representation  by  adding  all  information  that  was  implicit 
in  the  input  text.  Finally,  a natural-language  generator 
maps  the  most  relevant  sections  of  the  language- free 
representation  into  the  target  language.  We  illustrate 
this  process  with  examples  of  English-to-Spanish  and 
English-to-Russian  translations,  including  some  produced 
by  SAM,  a working  system  of  computer  programs  at  Yale 
University . 


(•)  This  author  is  presently  a member  of  the  Computer  Science 
Department  at  Carnegie-Mellon  University,  Schenley  Park, 
Pittsburgh,  Pennsylvania  15213.  USA. 

(*•)  This  author  is  presently  a member  of  the  Electrical 
Engineering  and  Computer  Science  Department  at  The  University  of 
Connecticut,  Storrs,  Connecticut  06266,  USA. 

This  work  was  supported  in  part  by  the  Advanced  Research  Projects 
Agency  of  the  Department  of  Defense  and  monitored  by  the  Office  of 
Naval  Research  under  contract  NOOOU-75-C-1 1 1 1 


f 

• 

; 

i 

i 


i i 


— * 


f 

- 2 - 

1 . 0 Introduction 

Detailed  understanding  of  the  source  text  is  necessary  for 
reliable  machine  translation  (MT).  This  paper  discusses  a 
paradigm  for  knowledge-based  MT  and  reports  work  in  implementing 
such  a system. 

In  a survey  of  the  early  machine  translation  efforts, 

Bar-Hillel  (I960)  conjectured  that  the  problem  of  competent 
automatic  translation  of  a text  is  equivalent  to  the  problem  of 
full  understanding  of  that  text.  Because  of  the  tremendous  amount 

; 

of  information  required  for  understanding,  and  the  complexity  of 
applying  such  information,  Bar-Hillel  concluded  that  machine 
translation  in  general  was  not  feasible. 

Since  the  time  of  that  survey,  several  important  practical 
and  theoretical  advances  have  been  made  which  suggest  that  some 
progress  toward  knowledge-based  machine  translation  may  now  be 
possible.  On  the  practical  side,  available  computing  hardware  is 
several  orders  of  magnitude  more  powerful  than  one  could  have 
expected  in  I960.  Further  improvements  in  memory  sire  and 
processing  speed  are  on  the  way.  More  significantly,  we  have 
greater  knowledge  about  memory  organization  and  accessing 
techniques  for  "intelligent"  systems  than  we  did  in  I960. 

One  important  class  of  theoretical  contributions  from 
Artificial  Intelligence  (AI)  research  has  been  concerned  with 
representation  of  meaning . Wilks'  Preference  Semantics  (Wilks 
1976),  Schank's  Conceptual  Dependency  theory  (Schank  1575), 
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(ftiegr-r  1975b),  and  Semantic 
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Rieger's  commonsense  algorithms 
Network  representations  (Quillian  1968,  Collins  and  Ouillian  1972, 
Hendrix  1975)  are  representative  examples.  Several  theories  of 
commonsense  inference  in  natural  language  processing  have  been 
proposed  (Charniak  1972,  Rieger  1975a).  Perhaps  most  importantly, 
methods  for  structuring  and  applying  large  amounts  of  world 
knowledge  have  been  devised,  including  Frame- systems  (Minsky  1975, 
Charniak  1976),  Knowledge  Representation  Language  ( KRL : Bcbrcw 
and  Winograd  1977),  and  Knowledge  Structures,  including  Scripts, 
Plans  and  Goals  (Schank  and  Abelson  1977). 

Using  representational  constructs  such  as  the  above,  AI 
researchers  have  had  a fair  degree  of  success  in  devising  working 
analyzers  (Woods  1972,  Winograd  1972,  Riesbeck  1975,  Marcus  1977) 
and  generators  of  natural  language  sentences  (Goldman  1975, 
McDonald  1977).  In  several  instances  these  modules  have  been 
combined  with  a "memory"  model  to  create  complete  understanding 
systems  that  achieve  a reasonable  depth  of  comprehension  in 
limited  domains.  Examples  of  complete  systems  include:  SOPHIE 
(Erown  and  Burton  1975)  and  SCHOLAR  (Carbonell  and  Collins  1973), 
which  apply  Semantic  Networks;  NUDGE,  the  Frame-based  scheduling 
assistant  (Goldstein  and  Roberts  1977);  PAM,  which  uses  Plans 
(Wilensky  1976);  SAM,  the  Script-based  story  understander 
(Cullingford  1975,  1976),  described  later  in  this  paper;  and 
POLITICS,  which  combines  many  different  types  of  knowledge 
structures  (Carbonell  1976). 


- H - 

These  developments  have  triggered  a revival  of  'nterest  in  MT 
in  the  A1  community.  For  example,  Wilks  constructed  an 
experimental  English-French  MT  system  (Wilks  1973),  based  on  the 
Preference  Semantics  scheme  for  representation  and  inference. 
Although  "practical"  MT  has  not  been  a primary  working  goal  with 
SAM  and  the  other  understanding  systems  at  Yale  University,  a 
rudimentary  capability  for  Engl ish-Mandarin  Chinese  translation 
(Stutzman  1976)  was  incorporated  in  an  early  version  of  SAM,  and 
more  recent  efforts  have  produced  limited  English-Spanish 
translations  of  newspaper  articles.  We  have  continued  to 
experiment  in  SAM  with  translating  newspaper  reports  into  Dutch, 
Russian  and  Arabic. 

This  paper  examines  some  important  problems  in 
knowledge-based  Ml  in  the  light  of  our  recent  experience  with 
these  programs.  We  outline  possible  solutions  to  some  cf  tnese 
problems,  using  examples  of  Spanish  and  Russian  translations  of 
English  newspaper  articles  read  by  SAM.  Then  we  proceed  to 
describe  how  Scripts,  as  a form  of  episodic  context . comprise 
necessary  part  of  a process  model  for  accomplishing 
meaning-preserving  translation. 

2.0  Translation  Understanding 

2.1  Knowledge  Sources  for  MT 

What  kind  of  knowledge  is  needed  for  the  translation  of 
texts?  Consider  the  task  of  translating  the  following  simple 
story  about  eating  in  a restaurant  from  English  into  Russien  or 
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from  English  into  Spanish: 


Story  1 : 

John  went  into  a restaurant.  He  ordered  a hamburger. 
When  the  hamburger  came,  he  ate  it. 


In  Russian  it  does  not  make  sense  to  say  "WHEN  THE  HAMBURGER 
CAME."  Instead,  a Russian  speaker  would  say  something  like:  "WHEN 
THE  HAMBURGER  WAS  SERVED"  ( byl  podan) . Similarly,  in  Spanish,  one 
would  have  to  say  "CUANDO  SE  LO  SIRV1ER0N"  (When  it  was  served  to 
him)  or  "CUANDC  SE  LC  TRAJERON"  (When  it  was  brought  to  him). 
Therefore,  direct  lexical  translation  with  syntactic  rules  is 


insufficient  to  form  a mean 

ingful 

tran 

slation  of  Story  1 

ir.  either 

Russian  or  Spanish. 

Semantic  rules  alone 

will 

not 

suffice  either. 

We  will 

postulate  some  semantic 

rules 

for 

translating  Story  1 

and  show 

that  any  such  rules  are 

in 

themselves  insufficient 

for  a 

meaning-preserving  translation. 

We 

will  demonstrate 

that  in 

addition  to  semantic  rules,  detailed  knowledge  of  the-  domain  is 
necessary  for  correct  translations. 

Let's  look  at  the  English-Russian  translation  process  first. 
We  could  try  to  solve  the  problem  caused  by  the  special  sense  of 
the  verb  "to  come"  in  Story  1 with  a translation  rule  such  as: 

Rule  1 : 

If,  in  the  clause  "X  came...  ",  X is  an  inanimate 
object,  then  translate  the  clause  as  "X  was  served". 

This  rule,  of  course,  is  much  too  simplistic.  For  example,  it 
would  not  work  in  the  following  story: 


Story  2: 

John  ordered  a book  by  mail.  When  the  book  came, 
John  realized  it  was  not  what  ne  wanted. 


Here,  a Russian  speaker  would  use  an  expression  similar  to  the 
English  "ThE  ECGK  CAME"  (kniga  prishia) . We  could  try  to  fix 
Ruie  1 by  requiring  the  subject  of  the  verb  "to  come"  tc  be  "food" 
instead  of  just  "an  inanimate  object",  but  this  would  not  help, 
since  if  in  the  same  story  John  had  ordered  an  extra  piste,  a 
Russian  speaker  would  have  still  usee  the  phrase  "1HE  PLATE  WAS 
SERVEL"  . 


Tne  important  factor  in  the  above  examples  is  the  context  in 
which  the  verb  "to  come"  appears.  The  fact  that  the  story  takes 
place  in  a restaurant  context  is  crucial  for  the  correct 
translation  of  Story  1.  Thus,  we  could  modify  Ruie  1: 

Rule  2: 

if  we  encounter  the  clause  "X  came  ...  " ir.  a 
restaurant  context,  translate  it  as  "X  was  served"  if 
X is  an  inanimate  object. 


There  are,  however,  difficulties  even  with  this  rule.  Consiaer 
the  following  story: 


Story  ?: 

John  and  Mary  ordered  lobster  in  a restaurant.  The 
waiter  told  them  a shipment  of  fresh  lobsters  was 
expected  any  minute.  They  decided  to  wait.  When  the 
lobsters  came,  the  waiter  asked  them  how  they  should 
be  prepared. 


Here,  the  phrase  "WERE  SERVED"  is  a totally  inappropriate 
translation  of  "WHEN  THE  LOBSTERS  CAME".  In  order  to  translate 
Story  3 correctly,  we  have  to  know  not  merely  that  we  are  in  the 
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restaurant  cc  text,  but  also  what  exactly  can  happen  in  a 
restaurant.  In  our  example  we  need  to  know  who  can  serve  whom, 
and  under  precisely  what  circumstances  "serving"  can  take  place. 
Thus,  we  could  further  modify  Rule  2: 


Rule  3: 

Use  the  Russian  verb  for  "to  serve"  only  when  a 
waiter  brings  something  to  a customer. 

Rule  3 win  suffice  for  certain  simple  restaurant  stories,  but, 
unfortunately,  it  is  not  applicable  even  to  our  original  story. 
Story  1 does  net  say  explicitly  who  brought  the  hamburger  to  whom. 
Tc  apply  Rule  3 in  translating  the  story,  we  neeo  a detailed 
account  of  what  can  actually  happen  in  a restaurant . Story  1 does 
not  say  that  it  was  the  waiter  who  brought  the  hamburger  . It  is 
our  knowledge  of  restaurants  that  supplies  us  with  this 
information.  For  instance,  we  would  need  to  know  something  like: 


If  something  which  is  normally  served  by  the  waiter 
(e.g.,  food,  utensils,  etc.)  arrives  at  the  location 
of  the  customer,  then  it  was  probably  the  waiter  who 
brought  it  there. 


Only  by  applying  this  information  in  conjunction  with  Rule  3 can 
we  arrive  at  a correct  Russian  translation. 


Analogous  problems  arise  in  formulating  a Spanish  translation 
of  Story  1.  In  Spanish  one  must  specify  the  recipient  case  in  the 
lexical  realization  of  sentences  containing  verbs  such  as  "to 
serve".  In  our  example,  a Spanish  speaker  must  say  the  equivalent 
of  "THE  HAMBURGER  WAS  SERVED  Ifc  HIM"  (or  "TO  JOHN").  This 
presents  a serious  problem  in  that  the  original  English  sentence 
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gives  no  hint  as  to  who  received  the  hamburger.  The  solution  is, 
oner  again,  to  apply  knowledge  about  the  workings  of  a restaurant. 
We  must  know  not  only  who  serves  the  food  but  also  who  receives  it 
in  the  restaurant  context:  If  food  is  brought  to  somebody  in  a 
restaurant,  then  that  person  is  probably  the  customer.  Inus,  if 
we  had  earlier  identified  John  as  the  customer,  we  could  apply 
this  rule  to  identify  him  as  the  recipient  of  the  hamburger,  and 
produce  the  proper  Spanish  translation. 

The  main  conclusion  to  be  drawn  here  is  that  competent 
translation  of  even  simple  texts  requires  (among  other  things!; 
access  to  a source  of  detailed  knowledge  concerning  what  usually 
happens  in  situations  such  as  eating  in  restaurants,  including  the 
order  in  which  things  happen,  and  the-  roles  that  various  actors 
play.  We  model  this  type  of  episodic  knowledge  for  the  computer 
in  r data  structure  called  a Script  (Schank  and  Abelscr  1577). 
Section  2.2  gives  a more  detailea  analysis  of  the  kinds  of 
knowledge  needed  to  solve  the  problems  arising  in  the  translation 
of  simple  newspaper  stories.  Section  3 describes  our  solution  to 
some  of  these  problems,  as  embodied  in  a working  computer  program 
called  SAM. 
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2.2  Translating  Newspaper  Articles 

Our  translation  paradigm  is  graphically  illustrated  in 
Figure  1 . 


- 


A 


Contextual  Knowledge 
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Language  1 > • * > Language  1 

* Language-free  * 

Language  2 > • Conceptual  Representation  * > Language  2 

• • 

Language  3 > *****«**•<*••»*•*•••#*•§•***#»  > Language  3 

ANALYSIS  GENERATION 

Source  Text  Internal  Representation  Target  Text 

Figure  1 


First,  we  analyze  the  source  text  into  a language-free  semantic 
representation.  Second,  we  apply  contextual  knowledge  of  the 
subject  matter  to  fill  in  the  things  that  were  left  unsaid  in  the 
source  text.  The  meaning  structures  for  the  sentences  of  the 
text,  together  with  the  interconnecting  inferences,  comprise  a 
complete  "conceptual"  representation  of  the  text.  Third,  we 
choose  appropriate  parts  of  the  conceptual  representation  ana 
express  them  in  the  target  language,  lr.  this  paradigm,  the  same 
language-free  representation  is  used  to  generate  many  different 
target-language  realizations.  Because  the  contextual  knowledge  is 
language  independent  (with  some  exceptions  which  will  be  discussed 
later),  it  can  be  used  to  translate  stories  in  a given  domain 
between  different  pairs  of  languages.  This  translation  paradigm 
is  Implemented  in  the  SAM  system. 
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Let's  consider  some  of  the  kinds  of  knowledge  that  need  to  be 
applied  to  produce  correct  translations  of  simple  English 
"newspaper"  stories  into  Spanish  and  Russian. 

Story  4 

(4a)  An  automobile  hit  a tree  Friday  evening. 

(4b)  David  Hall,  27,  a New  Jersey  man,  died. 

(4c)  Frank  Miller,  the  driver,  was  slightly  injured. 

(4d)  The  police  department  did  not  file  charges. 

To  translate  the  first  sentence  from  English  tc  Spanisn,  we 
must  disambiguate  the  word  "hit".  There  are  at  least  three 
different  Spanish  verbs  meaning  "to  hit",  and  each  is  much  more 
specific  than  the  English  verb.  The  translator  must  choose  amcr.g 
the  three  alternatives: 

PEGAR  — The  actor  (usually  the  subject  in  the  active  voice) 
applies  a force  to  the  object  with  the  explicit  intention 
of  realizing  a state  change  in  the  object.  Hence,  the 
actor  must  be  a human.  There  is  a strong  expectation 
that  the  object  is  animate  (i.e.,  that  it  will  feel  the 
force  and  its  aftereffects).  For  example,  the  sentence 
JOHN  JiH  MARY  will  be  translated  as  JUAN  LE  PEGC  A MARIA. 

GOLPEAR  — In  default,  it  is  assumed  that  the  actor  is  higher 
animate  (e.g.,  human).  There  is  a strong  expe  taticn 
that  the  actor  will  propel  an  instrument  whose  physical 
state  will  remain  unchanged,  and  that  the  object  with 
which  the  instrument  comes  into  physical  contact  may 
suffer  a state  change.  No  restrictions  are  placed  on  the 
object.  Thus,  the  sentence  JOHN  HIT  THE  NAIL  WITH  A 
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HAMMER  will  be  translated  as  JUAN  GOLPEO  EL  CLAVC  CON  UN 
MART1LLC. 

CHOCAR  — It  is  assumed  that  the  actor  did  not  intend  for  the  act 
to  occur.  There  is  an  expectation  that  the  actor  will  be 
self-propelled  but  mindless.  There  is  a strong 
expectation  that  the  actor  suffers  a state  change,  but 
only  a weak  expectation  that  the  object  does  likewise. 

A Spanish  speaker  will  use  the  verb  CHOCAR  to  translate  the 
first  sentence  of  the  Story  M: 

(l  I El  viernes  al  anochecer  un  auto  chocc  contra  un  arbcl. 

Thus  , in  order  for  the  Spanish  generation  module  of  an  MT  system 
to  properly  select  the  verb  CHOCAR,  it  must  have  access  to  a 
memory  representation  for  Story  A which  contains  information  about 
the  intentionality  of  entities  such  as  vehicles  and  trees.  In 
SAM,  the  language-free  representation  built  for  this  type  of  story 
includes  the  necessary  information.  Section  3 describes  hew  this 
representation  is  created,  and  how  SAM's  Spanish  generator  uses  it 
to  produce  the  correct  translation. 

A similar  disambiguation  problem  is  encountered  when  one 
tries  to  translate  sentence  (Aa)  into  Russian.  We  need  to  apply 
information  about  cars,  trees,  and  car  accidents  to  choose  the 
Russian  verb  NALETET'  as  the  correct  and  appropriate  translation 
of  the  English  verb  TC  HIT.  (NALETET'  literally  means  "to  fly 


onto.") 
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Translating  sentence  (4b)  into  Spanish  presents  no 
verb-selection  problem  regarding  the  state-change  of  a person 
going  from  being  alive  to  being  dead.  A Russian  speaker,  however, 
will  have  to  choose  among  UMER  (died;,  SKONCHALSYA  (passed  away), 
or  EYL  UE1T  (was  killed)"  to  express  a person's  death  in  a 
particular  context.  The  first  verb  is  not  sufficiently  "formal" 
to  be  used  in  a newspaper  account.  The  second  will  be  used  if  the 
death  occurred  some  time  after  the  accident.  The  third  expression 
will  be  used  if  the  death  occurred  during  the  accident.  Ir,  all 
cases,  the  Russian  sentence  will  be  incomplete  unless  the  place, 
the  time,  or  some  ether  circumstances  of  the  death  are  given. 
Once  again,  a detailed  representation  must  be  available  of  what 
actually  happened  in  the  accident  , derived  from  a "complete" 
understanding  of  Story  4. 

"Fuii"  understanding  is  also  needed  to  complete  the 
translation  of  sentence  (4b).  We  must  resolve  that  the  number 
"27"  in  the  English  is  the  age  of  David  Hall  in  years,  since  this 
must  be  made  explicit  in  Spanish  and  Russian.  Similarly,  the 
second  appositivc  to  David  Hall's  name,  "a  New  Jersey  man",  must 
be  interpreted  as  a reference  to  Hall's  residence.  A literal 
Spanish  translation  of  "a  New  Jersey  man"  into  Spanish  (i.e.,  Uh 
HOMBRE  DE  NUEVA  JERSEY),  may  be  easily  misunderstood  to  mean  "a 
man  who  was  born  in  New  Jersey."  Furthermore,  the  literal  Russian 
translation  "chelovek  iz  New  Jersey"  is  not  only  ambiguous,  but 
also  totally  inappropriate  in  a car  accident  context.  The 
understander  realizes  Hall  is  no  longer  alive,  and  this  enables 
the  choice  of  the  past  tense  of  the  Spanish  verb  VIV1R  and  Russian 
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(to  reside).  We  want  to  say  "who  lived  in  New  Jersey" 
rather  than  "who  lives  in  New  Jersey". 

The  Spanish  and  Russian  versions  of  (4b),  then,  are  as 
follows : 

(6)  David  Hall,  de  27  anos,  que  vivia  en  Nueva  Jersey,  se 
murio . 

(7)  David  Hall,  27  let,  prozhivavshii  v shtate  N'yu-Dzhersi , 
skonchalsya  (na  meste  proisshestviya)  . 

In  (?),  the  parentncsized  phrase  means  "at  the  scene  of  the 
accident",  a setting  specification  needed  to  complete  the  Russian 
sentence  . 

New  consider  the  phrase  WAS  INJURED  in  sentence  (4c). 
Spanish  gives  a choice  of  the  expressions  ESTABA  HER1D0  and  GUEDC 
HERIDG.  One  can  say  ESTABA  HEfiiDO  if  there  is  no  known  connection 
between  "being  hurt"  and  the  actions  described  in  the  story,  cr 
QUEDG  HERIDG  if  "oeing  hurt"  was  a result  of  these  actions.  If 
the  context  of  automobile  accidents  is  not  considered,  we  may  miss 
a causal  connection  between  the  injury  and  the  events  described 
earlier.  That  is,  we  may  assume  that  Miller  was  injured  before 
the  other  events,  rather  than  as  a result  of  an  event  in  the 
story.  Yet,  it  is  clear  to  readers  of  (4c)  that  Miller  was 
injured  as  a result  of  the  car  (in  which,  by  inference,  he  was 
riding)  hitting  the  tree.  Such  causal  information  is  part  of  the 
episodic  representation  built  by  SAM,  which  is  used  by  the  Spanish 
generator  to  produce  the  following  translation  of  (4c): 
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(8)  El  chofer,  Frank  Miller,  que^o  un  poco  herido. 


The  Russian  version  o f (4c)  is: 


(9)  Vodidel',  Frank  Miller,  seriozno  ne  postradal. 


The  appropriate  translation  of  the  phrase  WAS  SL1GH1L1  INJURED  in 
this  context  is  SERIOZNO  NE  POSTRADAL  which  literally  means  "did 
not  seriously  suffer".  The  verb  and  the  negative  mode  are 
normally  used  in  accident  descriptions  to  contrast  with  the  very 
strong  statement  ("David  Hall  died")  in  the  previous  sentence. 
Thus,  the  program  which  generates  (9)  has  to  knew  not  only  what 
happened  in  the  stony,  but  also  what  it  said  pneviously. 


Sentence  (4d),  "the  police  did  not  file  charges,"  cannot  be 
translated  directly  into  either  Spanish  or  Russian.  In  this 
situation,  a Spanish  speaker  would  have  tc  say: 


(10)  La  policia  no  accuso  a nadie. 


which  literally  means  "the  police  did  not  accuse  anybody"  (•).  lr. 
many  Spanish-speaking  countries,  the  legal  procedure  following  a 
serious  accident  calls  1 or  the  police  to  make  an  accusation  very 
shortly  after  the  accident  if  they  believe  a crime  may  have  been 


committed.  There  is  then  a period  of  several  days  during  which 
the  police  conduct  an  investigation,  prior  to  the  formal  filing  of 


(•)  Sentence  (10)  literally  means  "the  police  did  not  accuse 
nobody."  In  Spanish,  double  negatives  are  used  to  emphasize  that 
an  expected  event  did  not  occur.  That  is,  although  the  reader  of 
Story  4 may  have  expected  somebody  to  be  accused  of  a wrongdoing, 
there  was  no  accusation  made. 
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charges.  Hence,  a newspaper  account  of  a recer.‘  accident  would 
not  normally  refer  to  the  prosecution  procedure,  but  to  the 
accusation.  In  English-speaking  countries,  an  official  accusation 
must  be  accompanied  by  initiating  a formal  prosecution,  without  a 
delay.  Accordingly,  competent  translation  of  (4d)  into  Spanish 
requires  that  the  translator  understand  the  difference  between  the 
knowledge  structures  describing  accusations  and  prosecutions  in 
English-  and  Spanish-speaking  countries. 

A Russian  speaker  will  have  even  more  serious  difficulties  in 
translating  sentence  (4d)  . We  consulted  six  native  speakers  of 
Russian,  two  of  whom  are  professional  translators.  All  were 
puzzled  as  to  how  this  sentence  should  be  translated.  The 
difficulty  here  is  tnat  in  Russia  the  police  never  file  charges 
themselves.  Accusations  can  be  made  only  by  the  prosecutor's 
office.  The  role  of  the  police  in  such  cases  is  to  gather  ana 
present  the  available  evidence . Moreover,  the  fact  that  no 
charges  were  filed  sounds  very  odd  to  a Russian.  Someone  must 
always  be  at  fault,  and  the  authorities  will  always  find  out  who. 
Otherwise , a Russian  would  normally  assume  that  a cover-up  must 
have  taken  place.  Thus,  a Russian  who  understands  English  but  is 
not  familiar  with  the  Western  judicial  system  will  not  even 
understand  the  meaning  of  sentence  (^d).  When  forced  to 
translate,  he  will  do  it  literally: 

(11)  Polizia  ne  pred'yavila  nikakikh  obvinenii. 

This,  native  speakers  agree,  does  not  mean  much  to  a Russian. 
Sentence  (11),  although  syntactically  and  even  semantically 
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correct,  does  net  correspond  to  any  knowledge  structure  in  the 
mind  of  the  Russian  listener.  A Russian  speaker  who  is  familiar 
with  Western  judicial  practice  will  use  sentence  (11)  only  when 

' 

speaking  in  Russian  to  another  person  with  the  same  background 

I 

about  a car  accident  the  USA . Only  in  this  situation  will 

. 

neither  speaker  nor  listener  notice  any  anomaly  in  (11;.  A good 
translator,  of  course,  will  be  aware  of  these  cultural 
differences,  and  will  briefly  explain  them  ir.  his  translation.  In 
most  cases,  however,  the  fact  that  the  police  did  not  file  charges 

would  be  considered  unimportant,  and  hence  would  be  omitted  from 

! I 

the  Russian  translation. 

i 

2.2  Some  Conclusions 

At  this  point,  we  are  in  a position  to  make  several  claims 
about  the  necessity  for  applying  world  knowledge  ir.  MT . 

Basically,  we  believe  that  competent  translation  of  a connected 
text  is  impossible  unless  the  text  has  first  been  understood,  in 
some  reasonably  deep  sense. 

Specifically,  we  argue  that  translation  cannot  be  dene  purely 
on  the  basis  of  syntactic  manipulations,  even  if  these  are 
augmented  by  "context-free"  semantic  rules.  Every  story  is  placed 
in  some  well-defined  knowledge  domain,  and  a translator  constantly 
applies  information  from  that  domain. 

’ 

This  has  two  significant  implications.  First,  for  MT  to  be 
possible  at  all,  the  knowledge  context,  or  a large  part  of  it, 
must  be  shared  between  the  speakers  of  the  source  and  target 
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languages.  There  is  nc  sense,  for  example,  in  trying  tc  translate 
a radio  report  of  an  American  football  game  for  a Koi-Koi  or  a 
Cambodian  peasant. 


The  second  major  implication  of  the  need  for  a knowledge 
context  is  that  translation  of  isolated  sentences  is  at  best 
unreliable,  and  often  impossible,  since  single  sentences  do  net 
unambiguously  establish  the  context.  For  example,  the  Spanish 
sentence  JUAN  SE  LG  COMIO  is  translated  into  four  radically 
different  English  sentences  depending  cn  the  surrounding  context. 

(12s)  Juan  tenia  un  refuerzo  de  jamon.  Juan  se  lo  c om i o . 

(12e)  John  had  a ham  sandwich.  John  ate  it . 

(Ijs)  Juan  estaba  manejandc  su  auto  de  noche . En  el  medio  de 
la  calle  habia  un  arbol  caido.  Juan  se  lo  comio . 

(15e)  John  was  driving  his  car  at  night.  There  was  a fallen 
tree  in  the  middle  of  the  road.  Jchn  ran  into  it . 

(1ms)  Juan  estaba  buscando  su  anillo  en  el  cajon.  Aunque  el 
anillc  estaba  debajo  de  uncs  papeles,  en  su  busqueaa 
apurada  Juan  se  lo  comio . 

(14e)  John  was  looking  for  his  ring  in  the  drawer.  Although 
the  ring  was  underneath  some  papers,  in  his  hurriea 
search  Jghn  missed  1L,. 

(15s)  Juan  estaba  jugando  ajedrez  con  Pedro.  Pedro  movie  su 
aifil  al  medio  del  tablero.  Juan  se  lo  comio . 

( 1 5 e ) John  was  playing  chess  with  Peter.  Peter  meved  his 
bishop  to  the  center  of  the  board.  John  captured  it . 

1 

ft 

Thus,  in  order  to  translate  sentences  from  simple  stories  we 
first  need  to  understand  the  story.  But  what  does  it  mean  for  a 
computer  program  to  "understand"  a story?  This  question  will  be 
addressed  in  the  next  section,  in  terms  of  the  model  of  machine 
translation  embodied  in  the  SAM  system. 

II 


i 


18 


! 

! 


) 


Z-O  An  Example  si  Script-based  Machine  Translation 

SAM  (Script  Applier  Mechanism:  Cullingford  1976)  is  a system 
of  computer  programs  running  at  Yale  University  that  uses  a 
Knowledge  Structure  called  a Script  to  "understand"  newspaper 
stories  about  factual  events,  by  "factual",  we  mean  descriptions 
of  "overt"  or  "physical"  activities,  such  as  car  accidents,  plane 
crashes,  natural  disasters  and  state  visits.  More  "interesting" 
stories,  of  course,  refer  to  all  sorts  of  non-physical 
information,  such  as  political  or  ideological  beliefs  (see,  e.  g, 
Carbonell  197c).  Every  story,  however,  has  at  least  some  factual 
component  which  a Script  can  account  for. 

Scripts  are  a medium  for  representing  people's  everyday 
knowledge  about  stereotyped  activities  such  as  going  to  birthday 
parties  or  museums,  driving  cars  or  riding  the  subway.  This 
knowledge  is  expressed  in  Conceptual  Dependency  (Schank  1975),  a 
formalism  for  language-free  meaning  representation.  SAM  uses  an 
extension  of  the  Script  idea  to  encode  what  ordinary  people  know 
about  certain  kinds  of  events  they  see  in  the  newspaper.  For 
example,  the  motor- vehicle  accident  Script,  JVEHACCIDENT  (*), 
contains  a detailed  account  of  the  happenings  in  a crash  and  its 
aftermath,  including  actions  of  the  police,  ambulance  service  and 
hospital . 


(•)  In  this  paper,  upper  case  names  preceded  by  are  names  of 
Scripts.  Names  preceded  by  "4"  refer  to  Script  variables,  that 
is,  to  props  and  roles  from  the  Script.  For  example,  &BUSDRIVER 
is  the  "bus  driver"  role  from  the  bus  Script,  $BUS. 
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The  accident  Script  is  a structure  of  event  sequences,  called 
causal  chains  (Schank  and  Abelson  1977).  Each  causal  chain 
characterizes  one  well-understood  happening  ("episode";  from  the 
situation.  The  episodes  contain  well-defined  roles,  cr  Script 
variables,  which  prescribe  who  can  be  doing  what,  and  the  episodes 
are  grouped  into  scenes  by  the  importance  of  the  activity  in  the 
accident  context.  There  is,  for  example,  a "crash"  scene,  which 
describes  the  various  ways  that  a vehicle  can  go  out  of  control 
and  run  into  something.  Script  variables  in  these  events  include 
the  functions  for  "vehicle,"  "driver"  and  "obstruction." 


The  example  car-accident  story  (Story  4)  considered  in  the 
last  section  is  a shortened  version  of  a summary  formed  by  SAM. 
The  story  below  was  the  input  to  SAM,  which  generated  the 
following  English  ana  Spanish  summaries  as  part  of  its  output: 


Story  5: 

Friday  evening  a car  swerved  off  Route  69.  The  vehicle 
struck  a tree.  The  passenger,  a New  Jersey  man,  was 
killed.  David  Hall,  27,  was  pronounced  dead  at  the  scene 
by  Dr.  Dana  Blauchard,  medical  examiner.  Frank  Miller, 
32 , of  593  Foxon  Rd , the  driver,  was  taken  to  Milford 
Hospital  by  Flanagan  Ambulance.  He  was  treated  and 
released.  No  charges  were  made.  Patrolman  Robert  Oncfrio 
investigatea  the  accident. 

English  Summary: 

AN  AUTOMOBILE  hIT  A TREE  NEAR  HIGHWAY  69  FOUR  DAKS  AGO. 
DAVID  HALL,  AGE  27,  RESIDENCE  IN  NEW  JERSEY,  THE 
PASSENGER,  DIED.  FRANK  MILLER,  AGE  32,  RESIDENCE  AT  593 
FOXON  ROAD  IN  NEK  HAVEN,  CONNECTICUT,  THE  DRIVER,  WAS 
SLIGHTLY  INJURED.  ThE  POLICE  DEPARTMENT  DID  NOT  FILE 
CHARGES. 

Spanish  Summary: 

UN  AUTO  CHOCC  CONTRA  UN  ARBOL  CERCA  DE  LA  CARRETERA  69  EL 
V1ERNES  AL  ANOCHECER.  DAVID  HALL,  DE  27  ANOS,  QUE  VI VI A 
EN  NEW  JERSEY,  EL  PASAGERO,  SE  MUR10.  FRANK  MILLER,  DE  32 
ANOS,  QUE  V1V1A  EN  593  CALLE  FOXON  EN  NEW  HAVEN, 
CONNECTICUT,  EL  CHOFER,  QUEDO  UN  POCO  HER I DO.  EL 
DEPARTAMENTO  DE  POLICIA  NO  ACUSO  A NADIE. 
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The  details  of  SAM's  operation  are  given  in  (Cullingfora  197b). 

In  the  following  section  we  brifely  describe  the  various  processes 
used  to  understand  story  5.  Ihe  summaries  above  were  generated  by 
SAM;  our  analysis  below  concentrates  on  how  a portion  of  the 

i 

Spanish  summary  was  generated. 

i 

C.1  The  Understanding  Process 

The  machine-translation  configuration  of  SAM  is  sketched  ir 
Figure  2.  As  shown  in  the  figure,  Ml  is  accomplished  ir,  two  phases: 

(1)  understanding  the  source-language-  text  and  building  a 
language-free  representation  for  it;  and  (2)  formulating  a 
language-free  summary  or  paraphrase  of  the  story  from  the  memory 
representation  and  expressing  it  in  a target  language. 


1 

t 


I 


V » . 


21 


UNDERSTANDING 
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Figure  2 

Machine  Translation  Configuration  of  the  SAM  system 
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SAM  operates  during  story  understanding  as  a collection  of 
"experts",  each  module  consulting  a specialized  knowledge  source 
(see  Figure  2).  One,  called  ELI  (English  Language  Interpreter, 
Riesbeck  and  Schank  1976,  Gershman  1977),  analyzes  the  text  into  a 
meaning  representation.  A second  module,  PP-Memory,  tags  ana 
identifies  references  tc  Picture  Producers  (PPs:  people,  places  and 
things  having  a definite,  static  memory  reference).  Finally,  the 
Script  Applier  finds  the  inputs  in  its  data  base  of  Scripts. 

ELI  is  the  only  module  of  SAM  that  is  concerned  with  linguistic 
input.  It  is  a dictionary-based  system  which  uses  "expectations"  as 
its  control  mechanism.  The  input  is  read  from  left  tc  right  one 
word  at  a time.  The  meaning  and  syntactic  function  of  each  word 
have  to  satisfy  the  expectations  set  up  by  the  conceptual  structures 
built  by  previous  words  and  context  requirements.  The  Analyzer, 
however,  does  not  have  a separate  syntactic  preprocessor.  Syntactic 
constraints  are  applied  only  as  needed  tc  attain  the  main  goal  of 
the  Analyzer  - to  assemble  a semantically  well-formed  Conceptual 
Dependency  representation  of  the  input.  A detailed  discussion  of 
the  workings  cf  ELI  is  beyond  the  scope  cf  this  paper. 

ELI  builds  two  types  of  conceptualizations:  PPs  and  complex 
relations  among  the  PPs  whose  central  elements  are  the  ACTS  anc 
STATES  of  Conceptual  Dependency.  PP-Memory' s job  is  to  find  the  PPs 
in  a conceptualization  and  assign  tokens  to  them.  Tokens  are  tags 
or  handles  by  which  the  PPs  will  be  known  to  the  rest  of  SAM.  This 
module  also  supplies  tokens  for  Script  variables  which  the  Script 
Applier  encountered  in  building  the  story  representation,  but  which 
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the  story  text  did  not  mention.  The 
deal  with  is  that  of  "reference":  is 

someone  SAM  has  seen  before,  is  it 
token  known  to  the  system  (e.  g.,  a 
"President  Carter"),  or  is  it  something 


basic  problem  PP-Kemory  must 
an  input  PP  something  or 
a reference  to  a "permanent" 
well-known  person  such  as 
totally  new  to  the  system? 


The  Script  Applier  is  the  repository  for  episodic  information 
about  what  can  happen  in  a known  context.  It.  performs  the  basic 
tasK  of  "understanding"  in  SAM:  locating  an  input  in  the  currently 
accessible  parts  of  an  active  Script,  linking  the  new  input  up  with 
what  has  come  before,  and  making  predictions  about  what  is  likely  to 
be  read  next  in  the  story.  In  each  of  these  activities,  the  Script 
Applier  uses  world  knowledge  to  make  explicit  the  connections,  or 
Inferences . which  are  only  implicit  in  the  text. 

The  most  important  kino  of  inference  SAM  makes  is  filling  in  £ 
causal  chain . In  reading  Story  5,  the  structure  of  $VEHACC1DEM 
tells  the  Script  Applier  which  sequence  of  causally  connected  events 
to  select  and  instantiate  in  order  to  connect  explicitly  mentioned 
events,  ke  read  about  a crash,  then  about  a person  being  taker,  tc 
the  hospital.  How  can  these  events  be  connected?  SAM  applies  its 
knowledge-  about  car  accidents  and  the  functions  of  ambulance 
companies  (the  ambulance  Script)  to  fill  in  the  probable  causal 
relations  that  someone  saw  the  crash  and  called  an  ambulance,  that 
the  ambulance  came  to  the  scene,  that  the  ambulance  attendants 
placed  the  person  on  a stretcher  and  put  the  stretcher  into  the 
ambulance,  etc.  It  also  makes  the  crucial  connection,  never  stated 
in  the  story,  that  the  person  who  was  taken  to  the  hospital  in  story 
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5 must  have  been  injured  in  the  crash.  The  reason  it  can  dc  this  is 
because  it  "knows"  what  ambulances  and  hospitals  arc-  for,  in  the 
sense  that  the  appropriate  Scripts  connect  together  for  the  purpose 
of  aiding  people  who  are  sick  or  hurt,  and  cannot  gel  to  the 
hospital  under  their  own  power.  A necessary  part  of  filling  in 
causal  chains  is  role-instantiation:  specifying  the  necessary 
properties  a PP  must  have  to  fill  a specified  role  in  an  event.  An 
example  ol  role-instantiation  can  be  seen  in  the  summary  of  Story  5,, 
which  asserts  that  the  "police  department,"  as  the  organization 
responsible  for  investigations  and  arrests,  chose  not  to  file 
charges  ir.  this  instance. 

Another  basic  class  of  inferences  in  the  SAM  system  is 
reference  specification.  The  need  for  this  process  arises  when  a 
Script  variable  which  has  already  been  bound  to  a PP  is  mentioned  ir. 
a subsequent  input.  At  this  point  a decision  has  to  be  made:  Car. 
the  new  PP  be  an  instance  of  an  old  one?  The  classic  reference 
problem  occurs  with  pronouns,  e.  g.,  can  "he"  be  the  "John"  we  heard 
about  earlier?  In  newspaper  stories,  a more  complicated  reference 
problem  arises  because  of  what  we  call  "paraphrastic  reference": 
the  use  of  arbitrarily  complex  noun  groups  to  refer  tc  the  same  PP. 
An  example  of  this  type  of  inference  is  the  process  of  recognizing 
that  the  man  from  New  Jersey  mentioned  in  the  third  sentence  of 
Story  5>  must  be  David  Hall,  age  27. 

SAM  also  uses  the  tlme/place  setting  cf  a story  for  inferences 
about  where  things  are  happening  and  how  long  they  take.  A Script's 
causal  chains  have  associated  default  values  for  the  length  cf  time 
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they  typically  use  up,  or  where  they  would  be  expected  to  occur. 
SAM  uses  these  defaults  in  Story  5 to  infer  that  the  crash  must  have 
occurred  on  the  same  day  as  the  "swerve,"  namely  Friday  evening. 
Cars  simply  cannot  stray  from  roads  for  very  long  (on  the  order  of 
seconds)  before  encountering  an  obstruction.  Similarly,  the  crash 
must  have  occurred  "near  Route  69,"  although  the  story  does  not 
explicitly  say  so.  This  is  because  roads  are  provided  with  all 
sorts  of  nearby  objects  for  cars  to  run  into. 


i 


Finally,  SAM  makes  various  kinds  of  delayed  inferences . 
Sometimes  a story  will  leave  a point  of  interest  to  a reader  hanging 
for  a while,  only  clearing  up  the  problem  in  a later  sentence.  The 
inferences  needed  in  these  cases  have  the  nature  of  "demons" 
(Charniak  1972),  hovering  around  and  waiting  for  a feature  of  an 
input  that  satisfies  their  expectations.  In  car  accidents,  for 
example,  we  want  to  know  whether  anyone  was  killed;  if  someone  was 
hurt,  how  badly;  what  the  police  did,  etc.  In  Story  5,  although  we 

know  that  the  man  taken  to  the  hospital  was  hurt  (because  we  knew 

that  this  is  what  ambulances  are  for),  we  cannot  initially  be  sure 
hew  seriously.  Will  the  person  be  operated  on  and  spend  some  time 
in  the  hospital?  Was  he  so  badly  injured  as  to  die  there?  This 
decision  cannot  be  made  until  the  sentence  stating  "He  was  treated 
and  released"  is  interpreted,  at  which  point  SAM  concludes  that  he 
must  not  have  been  too  badly  hurt. 

The  understanding  phase  of  SAM  is  completed  when  all  the 
sentences  of  a story  have  been  read,  located  in  a Script,  and 

connected  together  by  Script-driven  inferencing.  The  results  of 
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understanding  are  stored  in  a "permanent",  language-free  memory 
representation.  Let's  look  at  some  of  the  memory  structures 
constructed  for  Story  5>. 

The  story  representation  is  placed  in  a hierarchical 
property-list  structure , accessed  through  the  global  variable 
iSTORY.  Story  5 is  represented  as  a "sequential"  (SEC) 
instantiation  of  certain  episodes  of  the  accident  Script,  each 
episode  having  a most  important  event,  or  Maincon.  Additionally, 

| 

the  story  itself  has  a Maincon  (the  crash),  and  certain  events  have 
happened  which  "interfere"  with  the  flow  of  the  Script,  or  the 
well-being  of  one  of  the  participants.  In  this  story,  the  accident 
has  interfered  with  the  health  of  "Hall"  and  "Miller",  but  Miller's 
condition,  by  inference,  was  relieved  (i.  e.,  the  interference  was 
resolved;  at  the  hospital.  (Hall,  being  dead,  can't  be  helped.) 
Interference  and  resolution  events  are  marked  in  the  story 
representation  with  PATHVALUE  1NT  and  RES,  respectively.  Finally, 
the  SCORECARD  property  in  the  story  representation  points  to  an 
event  of  especial  interest  to  the  Script,  viz.,  what  the  police  did. 


! STORY:  (SEC  SCLAES) 

? 

SCLAB3 : 

SCR1PTNAKE  $VEHACC1DENT  The  active  Script, 

s 

MAINCON  EVNT^  Story  Maincon 

SCENECONS  (EVNTU  EVNT17  EVNT33)  Episode  Maincons 

INTERFERENCE  ( ( EVNT20  EVNT26 ) Interference/resolution 

( EVNT 1 U ) ) events 

SCORECARD  (EVNT33)  An  "interesting"  event, 

what  the  police  did 

Each  event  in  the  story  representation  is  a Conceptual 
Dependency  structure  containing  a reference  to  one  of  the  primitive 
CD  ACTs  or  STATES,  and  pointers  to  the  PP-tokens  which  figured  in 
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the  event.  This  is  the  structure  representing  the  stc-y  Maincon, 
the  crash,  which  is  based  on  the  ACT  ‘PROPEL*: 


EVNTH: 

VALUE  ((ACTOR  STRUCT 0 

<=>  (•PROPEL*) 

OBJECT  PHYSO) 

TIKE  (T1ME5 ) ) 

LASTEVENT  ( EVNT ; ) 

NEXTEVENT  ( EVNT20  EVNT  14  EVNT7) 


The  Maincor  of  the 
Script:  the  crash 


The  causal  predecessor 
of  the  crash  and  . . . 
its  causal  successors, 
including  "someone  died" 
and  "someone  was  hurt" 


STRUCTO: 

CLASS  ((^STRUCTURE) 

TYPE  (*CAR*; 

SUPEi-^ET  (‘VEHICLE*) 

ELEX  (AUTOMOBILE) 

SLEX  (AUTO) 

SROLES 

(UVEhACClDENT  . 4VEH1CLE1) 
UDRIVE  . 4VEHICLE1)) 


The  PP  is  a structured 
physical  object 
A pointer  to  the  def.  of 
"car"  ir.  PP-memory 
Its  functional  superset 
An  English  lexeme 
A Spanish  lexeme 
Pointer  to  the  Script 
roles : 1 ) accidents 
2)  driving 


PHYSO : 

CLASS  (//PHYSOBJ ) 

TYPE  ( *T  REE* ) 

ELEX  (TREE) 

SLEX  (ARBOL) 

SROLES 

(UVEHACCIDENT  . 4GBSTRUCT  ION ) ) 


The  PP  is  an  unstruc- 
tured physical  object 
A pointer  to  the  def. 

of  tree  in  PP-memory 
An  English  lexeme 
A Spanish  lexeme 


Connected  to  the  crash 
inferences,  defined  by  the 


conceptualization  is  a cluster  cf 
Script,  for  events  which  causally  can 


follow  it.  Here  are  some  of  the  inferences  made  in  this  story: 


EVNT 14 : 

VALUE  ((ACTOR  HUMO 

FROM  (‘HEALTH* 
TOWARD  (‘HEALTH* 
TIME  (TIME17 ) ) 
PATHVALUE  INT 


One  result  of  the 
is  that  someone 
VAL  (*N0RM*) ) ) 

VAL  (-10))) 


crasn 

died. 


This  is 
event 


an 

in 


interference 
the  Script 


EVNT20:  Another  result  is  that 

VALUE  ((ACTOR  HUM3  somebody  was  injured. 

FROM  (‘HEALTH*  VAL  (‘NORM*)) 


) 

» 
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TOWARD  (*HEALTH*  VAL 
INC  (-3)) 

PATHVALUE  INI 

EVNT26 : 

VALUE  ((<  = > ( TREATMENT 

DOCTOR  HUM9 
PATIENT  HUM3 ) ) 
TIKE  (TIME-1)) 
PATHVALUE  RES 


(NIL))) 

Another  interference 

This  is  the  resolution 
for  EVNT20:  treatment. 

This  is  a resolving 
event . 


HUMO: 

CLASS  (^PERSON) 
SURNAME  (HALL) 
PERSNAME  (DAVID) 
GENDER  ( #MASC* ) 

AGE  (UNITO) 
RESIDENCE  (POLITO) 


SROLES  ( ($VEHACCIDENT  . ADEADGRP) 
( $DR1VE  . APASSGRP 1 ) ) 


This  is  the  person  who 
died . 


His  age  , 27  years . 

This  is  his  residence, 
the  political  unit 
"New  Jersey". 


HUM; : 

CLASS  (((-PERSON) 

SURNAME  (MILLER) 

PERSNAME  (FRANK) 

GENDER  («MASC*) 

AGE  (UNIT  1 ) 

RESIDENCE  (LOCI) 

SRCLES  (UVEHACCIDENT  . AHURTGRP) 
UDR1VE  . ADRIVER1)) 


This  is  the  person  who 
was  injured 


His  age,  32  years,  and 
where  he  lives 


I 
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Finally,  we  show  an  "interesting"  event,  the  Maincon  of  the 
investigation  episode.  This  event  is  marked  as  being  cf  special 
importance  because  of  the  SCORECARD  property: 


EVNT33:  [Onofrio  decides  not  to  prosecute] 

VALUE  ((ACTOR  HUM7  <=>  («MBU1LD*) 

MOBJECT  ( ( <=>  ( $PR0SECUT10N 
CHARGEE  (NIL) 
CHARGOBJ  (NIL) 
CHARGEE  (0RG3) ) ) 
MODE  (MM0DE3) ) 

TIME  (T1ME319) 

[MM0DE3  * (#NEG#) ] 

HUM7 : 

CLASS  (IPERSON) 


* +i\  r st 
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SURNAME  (0N0FR10) 

PERSNAME  ( ROBERT ) 

GENDER  (*MASC* ) 

TITLE  (PATROLMAN; 

OCCUPATION  (“POLICEMAN*) 

SROLES  (UVEHACCIDENT  . 4P0LSTAFF; ) 

ORG3 : 

CLASS  (#0RGANIZAT1GN) 

ORGOCC  ($PGL1CE) 

SROLES  ( ( $VEhACClDEN'T  . &POLORG) ) 


3.2  The  Target  Language  Generation  Process 

After  a source  text  has  beer,  analyzed,  understood  and 
represented  in  memory,  a version  of  the  text  must  be  generated  in 
the  chosen  target  language.  The  generation  process  may  be 
conceptually  dividea  into  two  phases:  Deciding  what  to  say,  and 
then  deciding  how  to  say  it.  We  often  want  to  say  in  the  target 
language  essentially  the  same  thing  that  was  said  in  the  source 
text.  In  ether  circumstances  it  may  be  necessary  or  desirable  to 
paraphrase  or  summarize  the  meaning  representation  before 
generating  its  target  linguistic  realization. 

Summary  and  paraphrase  by  computer  arc  tasks  containing  major 
unresolved  problems.  SAM's  summary  module  works  or,  a 
language-free  basis  with  the  "important"  events  stored  in  the 
memory  representation,  such  as  the  Maincon,  interference  events 
and  events  on  the  scorecard.  It  is  not,  however,  free  of  such 
problems  as  culture-dependent  judgments  concerning  the  importance 
of  events,  or  the  idiosyncratic  viewpoint  of  the  reader.  As  such, 
SAM's  summarizer  must  be  considered  as  only  a first  step  toward 
the  solution  of  the  problem  of  automatic  summary/papaphrase . A 
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more  detailed  description  of  how  SAM  does  summary  ar.d  papaphrase 
is  given  by  DeJong  and  Stutzmar.  in  (Schank  and  Abelson  1977). 

In  this  paper,  we  emphasize  the  problem  of  how  to  generate 
natural  language  text  from  conceptual  descriptions  of  events, 
assuming  that  the  summary  module  has  already  done  its  job.  We 
have  built  generators  of  varying  degrees  of  sophistication  for 
Spanish,  Mandarin  Chinese,  Russian,  Arabic  and  Dutch.  This 
section  will  concentrate  on  Script-based  Spanish  generation. 

The  first  event  from  the  story  representation  to  be  expressed 
in  the  summary  is  the  Maincor,  of  the  accident  Script,  EVNTt  from 
the  story  representation.  An  "expanded"  Conceptual  Dependency 
(CL)  representation  for  this  event  is: 

(16)  [A  car  hit  a tree  Friday  evening.) 

( ( ACTCR  (^STRUCTURE  TYRE  (*CAR*) 

SUPERSET  (^VEHICLE*) 

TOKEN  (STRUCTO)) 

<=>  (*PR0PEL«) 

OEJECT  (IPHYSOBJ  TYPE  (#TREE*) ) ) 

TIME  ((WEEKDAY  FRIDAY)  (DAYPART  EVENING))) 

Spanish  translation: 

(17)  EL  V1ERNES  AL  ANOCHECER  UN  AUTO  CHOCO  CONTRA 
UN  ARBOL. 

The  Spanish  generator  is  based  on  Goldman's  (1975)  CD-to-English 
generator.  Its  input  is  the  CD  diagram  presented  above;  its 
output  is  the  Spanish  sentence  (17).  The  generator  has  access  to 
the  full  meaning  representation  of  the  accident  story  produced  by 
SAM  in  its  script  application  phase,  including  the  accident  script 
and  any  other  scripts  referenced  therein.  Neither  the  original 
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English  sentence  (4a)  nor  any  of  the  original  English  words  are 
accessible  to  the  generator.  It  has  no  need  for  them,  as  SAK  has 
already  extracted  all  the  relevant  meaning  from  the  input  text. 

The  generator  is  action-oriented.  Upon  receiving  a 
conceptualization,  it  must  first  decide  on  the  apprcpiate 
verb-sense  in  the  target  language  that  best  expresses  the  action 
cr  change  of  state  in  the  input  conceptualization.  A verb-sersr 
is  a semantically  unambiguous  entity  associated  with  a particular 
verb  ir.  the  target  language.  There  may  be  many  verb-senses 
associated  with  a single  verb,  one  for  each  different  meaning  (or 
usage)  of  the  verb.  "Takel"  and  "take!;",  for  example,  are  two 
senses  of  the  verb  "take".  "Takel"  means  to  take  possession  cf  an 
object,  as  in  "John  took  the  book  from  Mary,"  and  "take2"  means  to 
take  someone  or  something  tc  a new  location,  as  in  "t'.ary  took  John 
to  his  house".  The  reason  for  selecting  a verb-sense  is  that  cr.ce 
the  verb-sense  is  chosen,  there  are  severe  semantic  and  syntactic 
constraints  placed  upon  the  linguistic  cases  of  the  verb.  These 
constraints  are  usually  strong  enough  to  specify  a unique  mapping 
from  the  conceptual  cases  in  the  CD-diagram  to  the  linguistic 
cases  in  a grammatical  case  structure  generated  for  the  target 
language . 

Once  the  linguistic  case-structure  is  generated,  a grammar 
for  the  target  language  (encoded  as  an  Augmented  Transition 
Network:  Woods  1970,  Simmons  and  Slocum  1970)  is  applied  to 
generate  its  surface  linguistic  realization.  We  shall  focus  our 
attention  on  the  verb-sense  selection  and  related  processes,  as 
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the  grammatical  generation  stages  are  relatively  well  understood. 
Further  details  on  the  linguistic  aspect  of  generation  can  be 
found  in  (Goldman  1975). 

We  mentioned  in  Section  2.2  that  there  are  three  possible 
Spanish  translations  of  the  English  verb  "to  hit",  only  one  of 
which  is  correct  in  a given  context.  The  CD-representation  (17) 
gives  access  to  all  the  information  necessary  to  choose  the 
appropriate  Spanish  verb  sense.  The  CD-ACT  PROPEL  defines  a set 
of  possible  verb-senses.  Selecting  a member  of  this  set  is 
accomplished  using  a discrimination  network  (D-net). 

The  simplest  example  of  a D-net  is  a binary  decision  tree, 
lhe  root  node  of  the  PROPEL  D-net  asks  whether  the  ACTOR  of  the 
PROPEL  is  human.  If  the  answer  to  this  question  is  YES,  then  we 
proceed  to  the  left-hand-descendant  of  the  root  node;  if  the 
answer  is  NO,  as  in  (17)-,  ,we  proceed  to  the  right-hand-descendant. 
This  node  asks  whether  the  OBJECT  is  self-PROPELled . In  (17),  the 
answer  is  YES;  hence  we  proceed  to  the  next  left-hand-descendant . 
The  new  node  tests  whether  the  OBJECT  is  much  smaller  than  the 
ACTOR.  The  answer  is  NO  and  the  right-hand  descendant  is  tested, 
lr.  this  manner,  we  determine  that  the  ACTOR  is  self-propelled  and 
not  human;  the  OBJECT  is  not  particularly  fragile.  (The  latter 
information  comes  not  from  (17),  but  from  a query  to  PP-Memory 
about  the  properties  of  trees.)  At  this  point,  we  reach  a 
leaf-node  in  the  tree,  that  is,  not  a node  containing  a question 
whose  answer  directs  the  program  onward  into  the  tree,  but  one 
which  yields  the  unique  verb  sense  CHOCAR.  Figure  3 is  the  full 
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discrimination- net  for  Spanish  verb  senses  expressing  the 
conceptual  act  PROPEL. 
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Discrimination  net  for  Spanish  verbs  expressing  PROPEL  CD's. 
ACTOR  X < = = = > PROPEL  <— o—  X < — 1—  Z 

I 

/ 

I 

I 

MODE  Vi  [encodes  force  and  intentional ity] 

•START* 


Y 

1 

1 

Is  ACTOR 

Human?  -- 

NC 

1 

1 

1 

1 

Is  ACTION 

1 

1 

Is  ACTOR 

YC  — intentional 

? — N 

Y — 

self-propelled 

! 

j 

1 

1 

1 

1 

1 

! N 

Is  OBJECT 

Is  size 

1 

of  OBJECT 

1 

Is  ACTOR 

animate? 

much  smaller  than 

— gravity?-- 

i i 

i i 

: : n 

size  of 

ACTOR? 

• i 

i i 

1 

1 

1 

I 

Y ! ! 

i i 

• i 

i i 

i i 

Y ! 

1 

1 

N ! 

1 

J 

1 1 

1 1 

1 1 

! 1 

Instrument  Instrument  APLASTAR 


Is  OBJECT 


used?  — 

used? 

— 

! — ground?  ! 

1 

1 

1 

CHOCAR  : 

1 

y : n 

! Y 

! N 

Y 1 

N Is  force 

1 

1 

1 

1 

1 

very  small? 

PEGAR1 

1 

G0LPEAR1 

1 

1 

EMPUJAR 

TERREMOTO 

1 1 

1 1 

! Y IN 

CAER  ! ! 

Instrument  part 
of  body  of  ACTOR?- 


ROZAR  ! 

I 

I 

G0LPEAR2 


Y — Sharp? 


Y — Foot?  — N 


PATEAR  PEGAR2 


APUNALAR  GOLPEAR1 


Y - yes,  YC  - yes  or  cannot  tell, 
N - No,  NC  - no  or  cannot  tell. 

[ Figure  3 ] 
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Each  question  in  the  D-net  divided  the  set  of  possible  verb 
senses  for  PROPEL  into  two  (roughly  equal)  sets.  The  process  was 
repeated  for  each  new  set  until  the  working  set  contained  only  one 
candidate  (i.e.  the  D-net  reached  a leaf-node).  The  leaf  node 
CHOCAR  defines  a unique  mapping  of  the  conceptual  cases  from  ( 1 6 ) 
into  the  linguistic  case  structure  of  (17).  The  Spanish  grammar 
linearizes  the  linguistic  case-structure  representation,  adding 
the  adverbial  time  clause  in  the  appropriate  place,  and 
substituting  the  Spanish  lexical  items  for  "small  motorized 
vehicle"  and  "tree". 


The  second  conceptualization  to  be  expressed  in  the  summery 
is  the  "interfering"  event,  EVNT14,  from  the  story  representation: 


l 16)  [David  Hail,  27,  a New  Jersey  Man,  died] 

((ACTOR  (^PERSON  PERSNAME  (DAViE) 

LASTNAME  (HALL) 

AGE  ( #UNIT  TYPE  ( "YEAR* ) 

AMOUNT  (27) 

TOKEN  (UNITO)) 

RESIDENCE  (^POLITY  TYPE  ( *STATE* ) 

POLNAME  (NEW  JERSEY) 
TOKEN  ( POLITO ) ) 

GENDER  (*MASC*) 

TOKEN  (KUMO)) 

TOWARD  (•HEALTH*  VAL  (-10)) 

FROM  ("HEALTH*  VAL  (*N0RM*) ) ) 

TIME  ((  BEFORE  *N0W*  X)) 

(19)  DAVID  HALL,  DE  27  ANOS,  QUE  VIVIA  EN  NUEVA 
JERSEY,  SE  MURIO. 

The  Spanish  verb  sense  M0R1R2  is  chosen  by  applying  the  D-net 


associated  with  the  HEALTH  state  to  conceptualization  (18)  in  the 
same  manner  as  (16).  The  conceptual  ACTOR  case  is  mapped  into  the 
linguistic  AGLNT  case.  We  mentioned  in  Section  1 that  translating 
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the  age  and  residence  would  present  severe  problems  to  a system 
that  did  not  recognize  them  as  such.  Once  the  CD  diagram  (16)  has 
been  created,  some  of  the  difficult  problems  are  automatically 
resolved;  others  are  greatly  simplified.  For  instance  the  "27" 
in  Story  5 was  resolved  as  Hall's  age  in  years  by  the  Conceptual 
Analyzer,  ELI.  The  fact  that  sufficient  information  is  provided 
beforehand  makes  the  task  of  the  Spanish  phrase  generator 
straightforward.  It  maps  "AGE  (<IUNIT  TYPE  (•YEAR*)  AMOUNT  (27))" 
into  "DE  27  ANOS" , since  a person's  age  can  always  be  expressed 
the  same  way  in  Spanish. 

The  problem  of  expressing  "a  New  Jersey  mar"  in  Spanish  is 
greatly  simplified  by  the  representation  "RESIDENCE  (//POLITY 
POLTYPE  (“STATE* ) PGLNAME  (NEW  JERSEY))."  The  generator  must 
decide  whether  the  reference  is  to  Hall's  residence,  birthplace, 
present  location  or  destination,  since  each  of  these  alternatives 
would  require  a different  Spanish  expression.  The  grammatical 
construction  that  needs  to  be  generated  in  Spanish  differs  from 
the  original  English  expression.  The  former  is  an  appositive  noun 
phrase,  while  the  latter  is  a relative  clause.  Given  that  Spanish 
normally  expresses  residences  as  relative  clauses,  we  need  to 
determine  the  verb  of  the  relative  clause  and  its  proper 
conjugation.  A D-net  is  used  to  determine  whether  it  is  known 
that  David  Hall  owns  the  house  he  lives  in,  rents  an  apartment,  or 
was  temporarily  passing  through.  If  the  first  were  true,  we  would 
say  the  equivalent  of  "Hall  owns  a residence  in  NJ";  if  the 
second  were  the  case,  we  would  generate  something  like  "Hall  rents 
in  NJ",  etc.  Since  we  have  no  information  to  allow  us  to  answer 
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any  of  the  above  questions,  we  resort  to  the  most  general  verb 
expressing  the  idea  of  residence,  namely  VlVlh2,  meaning  "to  live 
in". 


The  conjugation  of  this  verb  is  determined  by  the  subject  of 
the  sentence,  and  the  proper  time  modality.  Determining  the  time 
modality  involves  determining  when  it  was  true  that  Hall's 
residence  is/was  in  New  Jersey.  This  involves  consulting  the 
entry  in  PP-Memory  for  David  Hall  (accessed  through  the  token 
HUhO)  created  during  the  understanding  process.  We  find  that  his 
HEALTH  is  at  -10,  meaning  he  has  died.  Therefore,  he  can  no 
longer  "live"  at  his  New  Jersey  residence,  and  we  conjugate  V1V1R2 
in  the  past  tense  . 

We  could  have  checked  the  conceptualization  being  generated 
to  see  whether  Hall  was  still  a resident  of  New  Jersey.  This 
would  also  have  yielded  the  right  answer  in  this  instance.  lr. 
general,  however,  the  current  conceptual ization  doesn't  contain 
the  data  that  is  needed.  Consider,  for  example,  translating; 
"George  Washington,  a Virginia  man,  led  the  Continental  Army." 
here,  only  the  PP-Memory  entry  for  George  Washington  can  tell  us 
that  he  can  no  longer  be  a resident  of  Virginia. 

The  next  summarizing  conceptualization  to  be  expressed  is  the 
second  "interfering"  event  from  Story  5: 
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(20)  [Frank  Miller,  the  driver,  was  slightly  injured] 

((ACTOR  (^PERSON  PERSNAME  (FRANK) 

LASTNAKE  (MILLER) 

GENDER  ( #MASC* ) 

SROLES  (UVEHACCIDENT  . &HURTGRF) 

( $DRIVE  . 4DRIVER1 ) ) 

TOKEN  (HUMS)) 

TOWARD  (•HEALTH*  VAL  (NIL)) 

FROM  (*HEALTH*  VAL  (*NORM*) ) ) 

INC  (-3) 

TIME  ((BEFORE  •NOW*  X))) 

(21)  EL  CHOFER,  FRANK  MILLER,  QUEDO  UN  POCO  HERIDO . 

To  select  the  Spanish  verb- sense  QUEDAR3,  the  generator,  as 
before,  applies  a state-change  D-net.  The  CD  representation  (20; , 
however,  dees  not  contain  all  the  necessary  information  for 
choosing  the  appropriate  verb-sense.  One  of  the  questions  asked 
by  the  D-net  is  whether  the  change  in  Miller's  health  occurred  as 
a result  of  his  scriptal  ROLE;  that  is,  did  Miller's  getting  hurt 
have  anything  to  do  with  his  being  the  driver  of  a vehicle?  This 
question  is  answered  by  a program  that  searches  the  instantiated 
$VEKACCIDENT  Script.  This  finds  a causal  connection  in  the  story 
representation  between  the  crash,  EVNTM,  and  the  event  of  Miller's 
injury,  EVNT20.  Therefore,  QUEDAR3  is  chosen.  The  negative 
INCrement  in  the  HEALTH  scale  and  its  final  value  (3  points  below 
normative)  yields  the  Spanish  adjective  HERIDO.  The  small 
magnitude  of  the  increment  tells  the  generator  to  modify  the 
adjective  with  UN  POCO,  meaning  "a  little".  Example  (21)  is  thus 
generated  in  a straightfoward  manner,  aside  from  the  necessary 
query  to  the  instantiated  script. 
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The  final  event  passed  by  the  summarizer  to  the  generator  for 
expression  is  the  "scorecard"  event  (EVNT33)  which  records  what 
the  police  decided  to  do  as  a result  of  the  accident: 


(22)  [The-  policeman  in  charge  did  not  press  charges] 

((ACTOR  (^PERSON  SURNAME  (ONOFRIO) 

TITLE  (PATROLMAN) 

GENDER  ( •MASC# ) 

TOKEN  (HUM?) 

SROLE  (UVEHACC1DENT  . &PCLSTAFF  ) ) ) 

<=>  («MBU1LD«) 

MOBJECT  ((ACTOR  HUM? 

< = > ( {PROSECUTION  CHARGEE  (NIL) 

CHARGOBJ  (NIL) 

CHARGER  (^ORGANIZATION 

ORGuCC  UPOLICE) 
TOKEN  (0RG3))) 

MODE  (•NEC*  «TS») ) ) 

TIME  ((BEFORE  •NOW*  X)) 

(23)  LA  P0L1C1A  NO  ACUSO  A NADIE . 

The  Spanish  generator  in  SAM  contains  certain  heuristics  to  avoid 
"verbose"  expressions  for  events.  One  of  these  applies  tc 
conceptualization  (22): 


Generation  Rule  1: 

If  the  CD  to  be  expressed  matches  the  pattern 
(ACTOR  X <=>  MBUILD  MOBJECT  Y) 
and  Y is  an  action  with  ACTOR  = X,  MODE  = NEG, 
then  express  instead  CD  Y. 


This  rule  essentially  states  that  persons  deciding  not  tc  do 
something  usually  do  not  change  their  minds:  "John  decided  not  to 
go"  vs.  "John  didn't  go."  The  generator  extracts  the 
conceptualization  in  the  MOBJECT  of  (22)  (i.e.,  the  decision 
arrived  at  by  the  policeman),  and  attempts  tc  express  it. 
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At  this  step,  it  runs  into  a problem  not  encountered  in  the 
first  three  examples:  there  is  no  correct  way  to  directly  express 
the  sub-conceptualization,  which  in  English  means  "the  police  did 
not  initiate  prosecution  (file  charges)."  D-nets  are  attached,  in 
general,  to  CD  ACls  or  STATES.  In  CD  diagrams,  ACTs  appear  to  the 
right  of  a "<=>".  The  generator  discovers  that  there  is  no  way  to 
directly  express  (22)  when  it  finds  no  D-net  indexed  under 
$PROSECUTION . 


Since  {PROSECUTION  is  a Script,  it  applies  the  following 
generation  heuristic: 


Generation  Rule  2: 

IF  there  is  no  D-net  attached  to  a Script  name, 

THEN  IF  MODE  r NEG 

THEN  search  the  Script  in  temporal  order 
for  the  first  event  that  is  a 
"precondition"  for  the  rest  of  the 
Script  to  take  place.  Bind  the  Script 
roles  in  this  event,  add  MODE  NEG,  and 
generate  that  event 

ELSE  retrieve  the  Script  Maincon,  bind  roles 
and  generate. 

Applying  this  rule,  the  generator  searches  the  {PROSECUTION  script 
(as  it  exists  in  the  minds  of  many  Spanish  speakers),  and  finds 
that  its  first  scene  describes  the  police  deciding  whom  to  accuse 
for  the  alleged  crime.  This  event  must  be  completed  before 
succeeding  events  in  {PROSECUTE  can  take  place  (e.g.,  the  police 
must  have  a suspect  before  making  an  arrest  and  proceeding  to 
initiate  a trial).  Since  the  search  is  successful,  the  generator 
adds  MODE  ((•NEG*))  (meaning  that  the  event  did  not  take  place). 
The  negated  accusation  event,  represented  below,  is  easily 
generated  in  Spanish. 
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(24)  [Police  do  not  accuse  someone] 

‘ j 

((ACTOR  ^ORGANIZATION  ORGOCC  ($POLICE) 

TOKEN  (ORG;)) 

<=>  ( *MTRANS* ) 

MOBJECT  ((<=>  ( $ACCUSAT10N  CHARGOBJ  (NIL) 

, J CHARGER  ORG3 

| CHARGEE  (NIL)))) 

FROM  ( * CP*  PART  ORG3) 

j TO  (•CP*  PART  (NIL))) 

MODE  ((»NEG»))) 


Using  the  D-net  attached  to  the  ACT  MTRANS,  the  Spanish  generator 
selects  the  verb-sense  ACUSAR1,  and  subsequently  generates 
sentence  (23) . 
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In  this  section  we  address  the  crucial  issue  of  whether 
"full"  understanding  is  necessary  for  the  correct  translation  of  a 
text  from  one  language  into  another  (and,  if  not,  just  how  much 
understanding  is  needed).  By  full  understanding,  we  mean  the 
mapping  of  a text  into  some  semantic  representation  adequate  for 
competent  question  answering. 


4.1  Translation  with  Minimal  Understanding 

Perhaps  the  most  common  attitude  of  past  and  present  MT 
researchers  is  expressed  in  the  following  excerpt  from  (Kittredge 
et.  al.  1976): 

...even  if  lexical  decomposition  proves 
theoretically  possible,  it  is  far  from  clear  that  it 
will  prove  efficient  in  semantically  complex  subject 
areas.  Thus  the  designers  of  heavy-duty  translation 
systems,  those  which  must  actually  produce  readable 
translations  of  texts  formed  over  an  extensive 
vocabulary,  are  more  or  less  faced  with  the  necessity 
of  leaving  the  lexical  unit  Intact.  Translation  then 
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requires  relating  the  lexical  units  of  the  two 
languages  within  the  framework  of  empirically 
established  syntactic  structures,  and  the  inventory 
ana  use  of  such  structures  invariably  differs  between 
lang uages. 

The  first  sentence  of  the  above  passage  states  that  understanding , 
even  if  possible,  may  be  inefficient.  Therefore,  the  authors 
conclude,  that  a practical,  "heavy-duty"  translation  system  will 
consist  of  a dictionary  of  the  source  language  in  which  every 
word-entry  will  contain  instructions  on  how  to  translate  it  into 
the  target  language.  These  instructions  will  use  such  information 
as  the  syntactic  structure  of  the  sentence,  the  syntactic  role  of 
the  word  in  the  sentence , and  the  constraints  the  corresponding 
expression  in  the  target  language  places  on  the  structure  of  the 
resulting  sentence.  Thus,  the  English  verb  OCCUR  is  translated 
into  French  as  SE  RENDRE  COMPTE  if  the  second  complement  is 
present,  as  is  the  case  in  IT  OCCURRED  TO  RAY  THAT  MAX  MIGHT  HAVE 
LIED.  The  entry  for  OCCUR  must  also  specify  that,  if  the  above 
translation  has  been  chosen,  then  the  translation  of  the  first 
complement  (RAY  in  our  example)  should  become  the  subject  of  the 
verb  SE  RENDRE  COMPTE. 


The  above  rule  is,  of  course,  a simplification  of  Kittredge 
et  al.'s  scheme.  In  practice,  the  input  is  first  converted  into 
some  tree-like  structure  that  explicates  some,  but  not  all,  of  the 
underlying  syntactic  relations  of  the  source-language  sentence. 
The  structure  focuses  on  those  relations  that  are  considered 
relevant  for  the  task  of  translation  between  the  particular  prir 
of  languages.  Sometimes  these  structures  are  annotated  with  some- 
rudimentary  semantic  information.  At  the  second  stage,  these 
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structures  are  transformed  into  the  corresponding  "syntax-tree" 
structures  of  the  target  language,  from  which  the  output  is  then 

generated . 

The  rules  governing  this  t. ansfermation  are  organised  in  what 
is  called  a transit i gragm^r  (Kittredge  et  al . 1976, 

l 

Nedobejkine  1976,  Boitet  1976).  The  characteristic  features  of 
this  approach  are:  (1)  there  us  no  "understanding"  of  the  input 
text,  since  these  systems  are  in  principle  unable  to  answer  even 
the  simplest  questions  about  the  input;  and  (2)  all  structures 
and  rules  used  in  the  translation  are  established  "empirically" 
for  each  pair  of  languages,  and  thus  are  necessarily  different  for 
each  pair.  Moreover,  the  word  "empirically"  often  reflects  the 
researcher's  ingenuity  in  finding  ad  hoc  heuristics,  rather  than 
any  principles  of  language  organization  and  use.  It  is  clear  that 
this  approach  to  FAHQ1  (Fully  Automatic  High  Quality  Translation  - 
Bar-Hillcl's  terminology)  is  inadequate,  as  has  been  argued 
repeatedly  in  the  last  18  years,  beginning  with  (Bar-Hillel  I960) 
and  continuing  with  the  examples  in  Section  2. 

In  recent  years,  efforts  have  been  made  in  two  directions  to 
overcome  the  inadequacies  of  the  above  approach:  (1)  more 
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extensive  use  of  semantics,  and  (2)  relaxation  of  the  requirements 
for  FAHQT.  Let's  examine  each  of  these  directions  in  turn. 
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H.2  A Little  More  Semantics 


The  main  objection  to  Ml  models  which  try  to  "understand"  is 
that  this  task  is  so  complex  as  to  lead  necessarily  to  inefficient 
Ml.  Thus,  some  MT  researchers  propose  the  use  of  as  "little 
understanding"  as  is  needed  for  an  adequate  translation  between  a 
given  pair  ol  languages.  Wilks  (197?)  proposes  a specialized 
intermediate  representation  system  for  every  pair  of  languages. 
This  representation  system  is  basically  an  extention  of  the  source 
language  (English  in  this  case),  which  includes  all  the 
information  necessary  for  the  correct  translation  of  the  input 
into  the  target  language  (French).  Each  meaning  of  a word  in  the 
source  language  is  described  by  a special  data  structure,  called  a 
formula,  and  each  formula  has  a set  of  functions  associated  with 
it  that  chocse  the  appropriate  target  language  expression.  These 
functions  can  test  the  structures  in  the  intermediate 
representation  of  the  text.  Thus,  in  some  sense,  each  English 
word  in  this  schema  has  a set  of  specialized  routines  which  "know" 
hew  tc  translate  the  word  into  French.  In  "hardship"  cases,  when 
the  information  necessary  for  correct  translation  is  not 
explicitly  mentioned  in  the  source  text,  Wilk's  system  would  call 
the  "real"  understander  (never  implemented).  This  understander 
would  have  to  translate  the  intermediate  representation  into  a 
"real"  meaning  representation,  make  the  necessary  inferences,  and 
then  translate  the  results  back  into  the  intermediate  language. 
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certain Ml  researchers  in  the  Grenoble  group  (Boitet  1976, 
Nedobejkine  1976)  take  a similar  approach,  retaining,  however,  the 
transfer  stage  of  the  process.  They  propose  several  levels  of 
transfer  corresponding  to  different  levels  of  the  system’s 
syntactic  and  semantic  sophistication.  Such  a system  attempts  tc 
translate  the  input  on  the  lowest  possible  level.  If  this  is 
inadequate,  it  goes  to  a higher  level.  As  in  Wilks's  system,  the 
structures  and  transfer  functions  need  to  be  defined  for  each 
different  pair  of  languages. 

14.3  Relaxing  Requirements 

The  conclusion  that  one  can  draw  from  our  examination  of 
these  efforts  is  not  surprising.  The  more  semantics  one  puts  in 
the  system  - the  better  (but  pcssibly  slower)  it  works.  Tr.is 
conforms  to  Bar-Hillel's  original  assertion  that  FAHOT  is  possible 
only  with  full  understanding.  This  observation  leads  us  back  tc 
the  second  item  in  our  discussion:  perhaps  by  relaxing  some  of 
the  requirements  for  FAHQT  we  can  find  a reasonable  solution  tc 
the  efficiency  problem  in  MT.  But  what  does  it  mean  tc  relax 
requirements  for  FAHQT?  In  the  early  days  of  MT,  this  meant 
translations  which  were  smoothly  readable  but  not  always  correct. 
The  slogan  in  those  days  was  "95%  accuracy".  This  is  clearly 
naive  and  misleading.  What  does  it  mean  for  a human  translation 
to  be  95%  accurate?  Over  what  class  of  texts  do  we  make  these 
measurements?  Even  a very  good  UN  translator  is  likely  to  be 
helpless  at  a mathematics  congress. 
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The  inadequacy  of  this  approach  was  discussed  at  length  by 
Bar-hilltl  (I960).  However,  the  main  objection  to  it  was  not  the 
emptiness  of  the  criterion,  but  rather  the  fact  that  the  reader  of 

such  a translation  would  not  be  able  to  detect  the  mistakes.  A i 
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similar  fear  of  undetectable  mistakes  was  voiced  in  (Hoffman 
1976).  But  isn't  this  the  case  with  human  translators  as  well? 

Even  good  human  translators  sometimes  misunderstand  the  input  or 
chocse  an  inappropriate  word  or  expression,  making  it  difficult 
for  the  reader  to  find  the  mistake.  Why  should  one  apply 
different  criteria  to  human  than  to  machine  translations?  It 
seems  that  most  MT  researchers'  basic  distrust  of  their 
"heuristic"  methods  leads  them  to  a distrust  of  their  computer 
programs.  For  example,  what  Bar-Hillel  was  in  fact  objecting  to 
in  his  I960  paper  was  the  use  of  "frequency"  methods  to  achieve  a 
smooth,  readable  output. 

One  must  apply  critera  based  on  human-level  translation,  or 
the  whole  idea  of  "Fully  Automatic  High  Quality  Translation" 
becomes  vacuous.  This  leads  to  the  replacement  of  FAKQT  with  a 
MACHINE-MAN  partnership  in  which  the  computer  supplies  a specially 
trained  human  editor  with  all  translations  of  all  the  words  in  the 
input  sentence,  with  suggestions  for  a few  possible  versions  of 
the  target  sentence.  In  the  best  case,  the  suggestion  would  be 
unique  and  grammatical.  The  editor  would  then  compile  the  "real" 
translation  in  the  target  language.  Alternatively,  the  users 
themselves  would  have  to  learn  how  to  read  and  make  sense  of  such 
computer  output.  Thus,  Josselson  in  his  1970  survey  of  MT  wrote: 
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Since  it  is  quite  obvious  that  the  product  of  any 
functioning  MT  system  will  never  look  the  same  as 
human  translation,  man  will  alsc  have  to  expend  some 
effort  in  reading  the  output.  This  process  will, 
however,  require  less  effort  than  the  acquisition  of 
reading  skill  of  another  language... 

Such  a computer  tool  might  be  useful,  but  no  one  would  call  it 

"translation"  in  any  usual  sense  of  the  word. 

M.U  The  Problem  of  Reliability 

We  believe  that  the  output  of  an  MT  system  should  be 
comparable  to  a reasonable  human  translation.  Tc  make  this 
possible,  the  MT  system  must  have  a reasonable  ability  to 
understand  what  it  is  translating.  But  how  can  one  be  sure  that 
translations  produced  by  computer  are  "reliable"?  We  take  the 
standpoint  that  this  is  equivalent  to  answering  the  question:  How 
can  we  be  sure  that  human  translations  are  reliable?  Our 
experience  with  ar,  individual  translator  can  only  tell  us  that, 
when  he  understands  a text,  he  can  usually  recreate  reliably  its 
content  in  another  language.  We  cannot  expect  even  a good 
translator  to  translate  reliably  a text  in  an  unfamiliar  area. 
One  car.  verify  the  accuracy  of  human  translations  in  two  different 
ways:  Compare  the  results  of  two  different  translators,  or  ask 
the  translator  to  expand  the  meaning  of  the  original  text,  when 
there  is  doubt  as  to  the  accuracy  of  a particular  passage.  The 
latter  method  is  preferred  for  real-time  human  translation.  As  an 
example  of  this  mode  of  translation,  consider  the  highly 
publicized  error  in  English-Polish  translation  that  occurred 
during  President  Carter's  recent  trip  to  Poland.  Carter  stated 


" r*\  m 


H6  - 


that  people  in  The  United  States  "desired  to  have  closer  ties  to 
the  Polish  people".  The  sleepy  translator  expressed  the  Polish 
equivalent  of  The  United  States  "lusted  after  the  Polish  people." 
On  a lexical/syntactic  level  it  is  difficult  to  determine  why  this 
translation  was  incorrect.  But,  if  asked,  the  translator  would 
quickly  have  realized  his  error,  and  explained  what  the  American 
President  had  really  meant.  Thus,  we  establish  a criterion  for 
reliable  translation:  The  translator  must  be  able  to  answer 
questions  about  the  meaning  of  the  source  text  and  elaborate  the 
target  text  to  make  the  meaning  clear  to  readers.  This  criterion 
is  met  by  our  programs,  since  they  can  answer  at  least  certain 
questions  about  things  they  car.  understand. 

M.5  Depth  of  Unaerstanding  ana  Reliability  of  Translation 

A possible  objection  to  our  schema  for  MT  is  that  some 
limited  form  of  translation  is  possible,  although  the  translator 
may  fail  to  fully  understand  the  source  text.  For  example,  one 
can  imagine  a person  without  deep  medical  training  translating  a 
highly  professional  conversation  between  two  doctcrs  from 
different  countries.  In  this  scenario,  translation  is  only 
possible  if  the  doctors  refrain  from  using  technical  jargon.  Our 
claim  is  not  that  such  a translation  would  be  impossible,  but  that 
it  would  be  unreliable . We  have  argued  that  people's  ability  to 
produce  a reliable  translation  of  a text  is  limited  by  their 
ability  to  understand  it.  The  interpreter  in  the  above  example 
would  have  to  have  some  model  of  the  medical  universe,  however 
naive  and  unsophisticated,  to  produce  any  kind  of  coherent 
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translation  at  all.  The  reliability  of  his  translation  would 
depend  on  how  closely  his  model  corresponds  to  the  real'  model 
shared  by  the  two  professionals. 

Let's  consider  an  example  in  which  "common  sense" 
understanding  (i.e.,  equivalent  to  that  of  an  ordinary  person 
rather  than  a specialist)  would  be  sufficient  to  generate 
reasonable  translations  of  these  newspaper  headlines: 

(25a)  Israel  seized  large  quantities  of  new  weapons  from  the 
USSR . 

(25b)  Israel  seized  territory  from  Syria. 

To  produce  accurate  Russian  translations  of  (25a  and  b) , a 
translator  needs  to  understand  only  that: 

1)  The  weapons  were  seized  not  from  the  Soviet  Union,  but 
presumably  from  a neighboring  Arab  country. 

2)  The  weapons  were  manufactured  in  the  Soviet  Union. 

5)  The  territory  was  indeed  seized  from  ( not  manufactured  by) 
Syria . 

In  (2 5a),  a Russian  would  make  these  distinctions  by  preceding  the 
name  of  the  country  by  the  preposition  "iz"  while  in  (25b)  he 
would  use  the  preposition  "u". 

A more  comprehensive  understanding  of  (25),  while  possible, 
is  not  necessary  for  the  purpose  of  ordinary  translation.  For 
instance,  a political  scientist  might  consider  the  US  reaction  to 
(25)  as  part  of  his  understanding  process,  but  this  goes  beyond 
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the  minimal  understanding  necessary  to  answer  questions  about  what 
actually  happened  in  (25).  While  producing  a detailed  political 
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analysis  of  a situation  is  beyond  the  capabilities  of  our  computer 
programs  (though  POLITICS  has  produced  some  preliminary  results  in 
this  area;,  they  are,  nevertheless,  able  to  produce  a translation 
roughly  equivalent  to  that  of  a human  with  a Similar  "factual" 
knowledge  base . 

Translation  can  be  achieved  on  various  levels  of 
understanding,  from  extremely  primitive  to  very  advanced.  For 
example,  a program  called  FRUMP  (DeJong  1977)  has  been  developed 
at  Tale  which  uses  a knowledge  structure  called  a "sketchy  script" 
to  process  a great  variety  ol  news  items  taken  directly  from  the 
UP!  wire.  FRUMP  produces  simple  one,  two  or  three  sentence 
summaries  in  English,  Dutch,  French,  Spanish,  and  Russian. 
FRUMP's  understanding  and  expressive  abilities  are  limited,  but 
the  program  works  very  quickly  and  can  answer  questions  about  what 
it  has  understood.  It  simulates  a person  skimming  through  a 
newspaper . 

One  might  argue  that  this  is  not,  strictly  speaking, 
translation.  Nevertheless,  this  is  what  people  often  do  with 
foreign  language  newspapers,  and,  therefore,  FRUMP  models  a 
minimal  order  of  understanding  as  applied  to  MT.  It  is  also  a 
useful  tool  in  reducing  the  enormous  flow  of  information  in 
different  languages  that  humans  have  to  handle.  The  SAM  system, 
described  in  Section  3,  is  the  best  developed  understanding  system 
at  Yale  to  date.  It  represents  a middle  range  of  sophistication 
among  our  systems.  Currently,  work  is  in  progress  on  exploring  MT 
in  intelligent  computer  systems  such  as  PAM  (Wilensky  1976)  and 
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POLITICS  ( Carbone  1 1 1 97fc ) . 


J*.6  "Professional"  vs.  "Ordinary"  Translation 


Our  basic  approach  to  MT  in  all  these  systems  is  what  might 
be  called  a "retelling"  ol  the  text  in  a different  language, 
rather  than  a translation  in  the  professional  sense  of  the  word, 
with  its  close  correspondence  to  the  structure  and  style  of  the 
original.  We  are  not  trying  to  simulate  a professional 
translator.  The  ability  to  translate  professionally  is  a highly 
specialized  skill,  which  requires  many  years  cf  training  in 
addition  to  a particular  talent.  Instead,  we  are  trying  tc 
simulate  an  average  non-professional  person  with  a working 
knowledge  of  two  languages  who  is  asked  to  read  a text  and  then  to 
reproduce  its  content  at  a desirable  level  of  detail.  As  far  as 
the  professional  skill  is  concerned,  we  believe  that  it  is  based 
on  this  more  "mundane"  ability  to  translate. 


Thus,  when  a good  translator  hears  an  intentionally  ambiguous 
expression,  e.g.,  in  a political  speech  or  a literary  composition, 
he  will  first  find  both  meanings,  analyze  why  the  speaker  used  the 
ambiguous  expression  in  the  first  place,  and  then  search  for  an 
expression  in  the  target  language  which  preserves  the  ambiguity. 
Ambiguity  on  purely  syntactic  and  lexical  grounds  is  never 
preserved  for  its  own  sake,  as  semantic  considerations  imposed  by 
the  context  will  invariably  rule  out  spurious  interpretations. 
Attempts  to  preserve  lexical  and  sytactic  ambiguities  in  an  MT 
system  without  proper  understanding  of  the  source  text  must  often 
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produce  unreliable  translations. 

*>.7  A Few  Words  about  Efficiency 

Finally,  some  comments  on  the  question  of  efficiency  of  MT 
systems  need  to  be  made.  One  of  the  main  objections  to  the 
"understanding-first"  schema  of  KT  is  its  alleged  inefficiency. 
In  answer  to  this,  one  can  only  say  that,  as  is  the  case  with 
humar.s,  efficiency  depends  on  the  desired  quality  of  translation. 
FRUMP  works  very  quickly  and  "efficiently".  SAM  works  mere 
slowly,  but  it  produces  better  translations.  Our  experience  with 
SAM,  however,  suggests  that  the  increase  in  SAM's  understanding 
abilities  does  not  lead  to  any  serious  decrease  in  its  speed.  The 
combinatorial  "explosion"  feared  by  Boitet  (1S76)  has  not  come  tc 
pass.  SAM  uses  a hierarchical  representational  scheme,  so  that 
processing  time  grows  by  a less  than  linear  function  of  the 
increased  information  in  the  system.  However,  in  a syntax-based 
system  that  allows  several  rewrite  rules  to  be  applicable  at  any 
one  time  and  generats  many  alternative  syntactic  parses  (or 
transformations)  of  a sentence,  the  combinatorial  explosion  in 
processing  time  (and  space)  as  a function  of  the  number  of 
applicable  rules  is  indeed  a serious  consideration. 

A different  inefficiency  problem  arises  in  transfer  grammars 
and  in  Wilks'  translation  paradigm.  Such  systems  require 
specialized  routines  to  translate  between  every  pair  &£  languages. 
The  number  of  pairs  of  languages  increases  as  the  square  of  the 
number  of  languages.  Hence,  if  one  wishes  to  translate  among  11 
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different  languages  one  would  need  110  transfer  grammars!  In  our 
understanding  paradigm,  source  text  is  mapped  into  a language- free 
representation,  and  that  representation  is  mapped  into  the  target 
language.  This  process  would  requires  11  language  analyzers  and 
11  generators,  for  a total  of  22  routines.  Moreover,  the  memory 
model  mediating  between  analysis  and  generation  would  be  the  same 
in  each  case  (ignoring  small  cultural  differences).  Thus,  the 
extra  memory  space  required  to  add  another  language  would  be 
expected  to  increase  only  linearly  with  the  number  of  languages. 

We  are  not  claiming  that  we  have  solved  the  problem  of 
knowledge-based  Machine  Translation.  Far  from  it  - we  feel  that 
we  have  only  scratched  the  surface.  Three  different  classes  of 
problems  need  to  be  addressed  in  our  approach  to  MT : (1) 
understanding  the  input  text,  (2)  generating  a coherent,  readable 
text  in  a given  language  from  a conceptual  representation  of  the 
text,  and  (3)  generating  an  output  which  conforms  tc  the  standards 
of  professional  translation.  We  have  concentrated  mostly  on  the 
first  set  of  problems,  since  we  consider  them  critical  to  the 
whole  approach.  Our  generators  are  admittedly  primitive.  We  have 
consciously  chosen  to  attack  the  problem  of  "what  to  say",  rather 
than  the  problem  of  "how  to  say  it".  In  SAM  our  summarizers  work 
with  conceptual  memory  on  the  one  hand,  and  with  various 
natural-language  generators  on  the  other  to  produce  reasonable 
summaries  of  the  stories  SAM  has  understood.  We  are  only 
! '0  nning  to  work  on  the  second  set  of  problems,  and  we  do  not  yet 
have  a model  of  language  generation  which  is  comparable  to  the 
sophistication  of  our  models  of  understanding.  We  have  not  even 
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touched  the  third  group  of  problems. 

4.6  Concluding  Remarks 

Our  work  on  MT  grew  out  of  our  research  on  natural  language 
understanding,  human  memory  mechanisms  and  natural  language 
generation,  when  we  put  these  modules  together  we  got  our  first 
working  MT  system.  The  main  advantage  of  the 
"understanding-generation"  schema  of  MT  is  that  every  problem  can 
be  put  in  the  context  of  human  problem-solving,  and  attacked  by 
all  known  memory  and  inference  mechanisms.  Another  advantage  of 
this  approach  is  that  the  psychological  plausibility  of  the 
proposed  mechanisms  can  be  used  as  a guiding  principle.  But  more 
importantly,  this  approach  makes  the  work  on  MT  not  an  isolated 
effort  to  develop  a set  of  task  specific  heuristics  but  a part  of 
a general  investigation  of  language  and  thinking. 
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